Word-Level Optical Font Recognition Using Typographical Features

نویسندگان

  • Soo-Hyung Kim
  • Hee K. Kwag
  • Ching Y. Suen
چکیده

Previous research efforts on optical font recognition have mostly limited applications since they deal with only a few types of font attributes and estimate them from a line or block of text. This paper proposes a word-level optical font recognition system for printed Korean and English documents. At the word-level, it has the advantages of obtaining more detailed font attributes including the following: script (Korean and English), font style (regular, bold, italic, and underlined), typeface (Myung-jo and Gothic), point size (10, 12, 14 pts), and word length (2, 3, 4, 5 for Korean, and 4 to 10 for English). A hierarchical classifier and several typographical features have been devised for the system, and their effectiveness are proven by an experiment with a database of 100 sets of 264 font categories.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multifont Classification using Typographical Attributes

This paper introduces a multifont classification scheme to help recognition of multifont and multisize characters. It uses typographical attributes such as ascenders, descenders and serifs obtained from a word image. The attributes are used as an input to a neural network classifier to produce the multifont classification results. It can classify 7 commonly used fonts for all point sizes from 7...

متن کامل

Optical Font Recognition Using Typographical Features

A new statistical approach based on global typographical features is proposed to the widely neglected problem of font recognition. It aims at the identification of the typeface, weight, slope and size of the text from an image block without any knowledge of the content of that text. The recognition is based on a multivariate Bayesian classifier and operates on a given set of known fonts. The ef...

متن کامل

A study on font-family and font-size recognition applied to Arabic word images at ultra-low resolution

In this paper, we propose a new font and size identification method for ultra-low resolution Arabic word images using a stochastic approach. The literature has proved the difficulty for Arabic text recognition systems to treat multi-font and multi-size word images. This is due to the variability induced by some font family, in addition to the inherent difficulties of Arabic writing including cu...

متن کامل

Hidden Markov Models in Text Recognition

1 Abstract A multi-level multifont character recognition is presented. The system proceeds by rst delimiting the context of the characters. As a way or enhancing system performance, typographical information is extracted and used for font identiication before actual character recognition is performed. This has the advantage of sure character identiication as well as text reproduction in origina...

متن کامل

Spanish Journal of Psychology, in press Does bold emphasis facilitate the process of visual-word recognition?

Does bold emphasis facilitate the process of visual-word recognition? Abstract The study of the effects of typographical factors on lexical access has been rather neglected in the literature on visual-word recognition. Indeed, current computational models of visual-word recognition employ an unrefined letter feature level in their coding schemes. In a letter recognition experiment, Pelli, Burns...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IJPRAI

دوره 18  شماره 

صفحات  -

تاریخ انتشار 2004